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DETAILED ACTION 
Continued Examination Under 37 CFR 1.114 

1 . A request for continued examination under 37 CFR 1.114, including the fee set forth in 
37 CFR 1 .17(e), was filed in this application after final rejection. Since this application is 
eligible for continued examination under 37 CFR 1.1 14, and the fee set forth in 37 CFR 1.17(e) 
has been timely paid, the finality of the previous Office action has been withdrawn pursuant to 
37 CFR 1.1 14. Applicant's submission filed on January 22, 2009 has been entered. 

2. Claim 106 has been amended. 

3. Claims 106, 109 and 111 are currently under consideration as drawn to the species 
mutation site 5653 of the plexinBl coding sequence and the A5653G mutation. 

Rejections Maintained 
Claim Rejections - 35 USC § 112 
The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 

3. Claims 106, 109, and 111 remain rejected under 35 U.S.C. 1 12, second paragraph, as 
being indefinite for failing to particularly point out and distinctly claim the subject matter which 
applicant regards as the invention essentially for the reasons set forth in the Office Action of 
October 27, 2008, section 3, pages 2-3. 
Examiner argued: 

Claim 107 refers to one or more mutation in a region of the nucleic acid which encodes, 
the cytoplasmic domain of the plexinB 1 polypeptide, Claim 1 08 refers to one or more mutations 
at site 5653 of the plexinBl coding sequence and claim 109 refers to the mutation A5653G. 
However, given that there is no point of reference given as to where the cytoplasmic domain of 
plexinBl begins or ends and there is no point of reference given as to where the mutations of 
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claim 108 and 109 are located, such as a SEQ ID NO: for plexinBl, the claims are indefinite as it 
cannot be determined to where these mutations are located. 

Applicants argue that they believe the amendments will obviate this rejection. 

Applicants' arguments have been considered, but have not been found persuasive because 
the mutations of claim 106 encompasses the mutations of claim 109 and the mutations of claim 
109 are not limited to SEQ ID NO: 1 12. Furthermore, SEQ ID NO: 1 12 is not AB0007867.1, 
but AB007867.1, see Appendix 1. Thus, it cannot be determined to which coding sequence the 
claims are drawn SEQ ID NO: 1 12 or AB0007867. 1 . Given its broadest reasonable 
interpretation the claim is drawn to SEQ ID NO: 1 12 or AB0007867.1 and the location of the 
mutations in AB0007867.1 is indefinite for the reasons previously set forth. 

The following is a quotation of the first paragraph of35U.S.C. 112: 

The specification shall contain a written description of the invention, and of the manner and process of making 
and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by the inventor of carrying out his invention. 

4. Claims 106, 109 and 111 remain rejected under 35 U.S.C. 1 12, first paragraph, as failing 
to comply with the enablement requirement for the reasons set forth in section 4, pages 3-10 of 
the Office Action of October 27, 2008. 
Examiner argued: 

One cannot extrapolate the teachings of the specification to the enablement of the claims 
because one of skill in the art would not be predictably able use changes in either the wild type 
plexinBl or the A5653G mutant to identify or obtain a putative anti-cancer agent. Although the 
A5653G mutation is found in primary and metastatic prostate tumors, this same mutant plexinBl 
reduces the tumorigenicity of cells in vivo. Thus, it is not clear if this mutation is a positive or 
negative regulator of prostate tumor or any tumor formation as the mutation appears to be 
associated with both positive and negative regulation of tumor formation and one of skill in the 
art would not predictably know what change in expression of the A5653G mutant Bl nucleic 
acid would be important for affecting tumor formation and would not predictably be able to 
identify and/or obtain a compoiind as a putative anti-cancer agent based on change in expression 
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of the A5653G mutant plexinBl . Thus, undue experimentation would be required for identifying 
and/or obtaining a putative anti-cancer agent by the claimed method. 

Additionally, it is not predictable that determining an increase in the wild-type plexinBl 
would lead to the identification of a putative anti-cancer agent. Although the specification 
teaches that the expression of the wild-type plexin Bl suppresses tumor formation, Mack and 
Gish (US Pat. App. Pub. 2004/0005563, June 17, 2002) teach that plexin Bl is upregulated in 
ovarian cancer, see Table 14A and para. 0348 of the published application and Vogelstein et al. 
(US Pat. App. Pub 2005/0047996, October 9, 2001) teach that plexin Bl is upregulated in 
colorectal cancer, see Table 1 . Thus, given that plexin Bl is upregulated in ovarian and 
colorectal cancers, the determination of an increase in the wild-type plexin Bl by a test 
compound would not predictably identify a putative anti-cancer agent. Thus, undue 
experimentation would be required for identifying and/or obtaining a putative anti-cancer agent 
by the claimed method. 

Furthermore, given that A5653G mutant plexin Bl has only be identified in prostate 
cancers, one of skill in the art would not predictable expect that agents that affect the expression 
of this mutant plexinBl nucleic acid would be putative anti-cancer agents for any cancer because 
it is well known in the art that cancers arc heterogeneous in phenotype and genes expressed and 
cancer therapeutics are not predictably effective for all cancers. 

In particular, cancers comprise a broad group of malignant neoplasms divided into two 
categories, carcinoma and sarcoma. The carcinomas originate in epithelial tissues while 
sarcomas develop from connective tissues, see Taber's Cyclopedic Medical Dictionary (1985, 
F.A. Davis Company, Philadelphia, p. 274). Given that not all cancers originate from the same 
tissue types, it is known that cancers originate from different tissue types have different 
structures as well as etiologies and would present differently. Thus, it would not be predictably 
expected that a nexus, for example drawn to a connection between the A5653G mutant plexin Bl 
and prostate cancer, would be established between two cancer types that arose from different 
tissue types. Further, it is well known that even two carcinomas that present on the same organ 
have significant differences in etiology and genetic constitution. For example, Busken, C et al, 
(Digestive Disease Week Abstracts and Itinerary Planner, 2003, abstract No:850), teach that 
there is a difference in COX-2 expression with respect to intensity, homogeneity, localization 
and prognostic significance between adenocarcinoma of the cardia and distal esophagus, 
suggesting that these two cancers have different etiology and genetic constitution (last five lines 
of the abstract). Additionally, Kaiser (Science, 2006, 313: 1370) teaches that in a genomic 
analysis of mutations in breast and colon cancers, it was found that the cancer genes differ 
between each colon and breast cancers and each tumor had a different pattern of mutations. 
Kaiser teaches that the steps to cancer may be more complex than had been anticipated, see 3'^'' 
col. Furthermore, Krontiris and Capizzi (Intemal Medicine, 4th Edition, Editor-in-chief Jay 
Stein, Elsevier Science, 1994 Chapters 71-72, pages 699-729) teach that the various types of 
cancers have different causative agents, involve different cellular mechanisms, and, 
consequently, differ in treatment protocols. Chemotherapeutic agents are frequently useful 
against a specific type of neoplasm and there are no drugs broadly effective against all forms of 
cancer, see Carter, S. K. et al. Chemotherapy of Cancer; Second edition; John Wiley & Sons : 
New York, 1981; appendix C. Given the above, it is clear that it is not possible to predictably 
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extrapolate any potential correlation between an A5653G mutant plexin Bl directed anti-cancer 
agents and prostate cancer sensitivity to such an agent in any tumor type based on the 
information in the specification and known in the art without undue experimentation. 

Furthermore, one of skill in the art would not predictably expect that all of the broadly 
claimed mutants of plexinBl to be associated with cancer and thus an effect on their expression 
would not predictably be useM for identifying a compound as a putative anti-cancer agent. It is 
noted that the specification teaches that a mutant plexinBl nucleic acid may comprise a 
nucleotide sequence which has one or more mutations relative to the wild-tjrpe plexinBl 
nucleotide sequence, as set out in AB007867. The mutations may be deletions, insertions or 
substitutions of one or more nucleotides see para. 0014 of the published application. Given the 
above and given that claims are drawn to contacting "a" plexin Bl nucleic acid, which reads on 
fragments, which comprises one or more mutations in a coding region of the nucleic acid, the 
broadest reasonable interpretation of the claims is that the claims are not limited to any specific 
plexinBl mutants and the plexinBl mutants can comprise nucleic acids that are completely 
distinct fi-om plexinBl. Furthermore, given claims 108 and 109 are indefinite in lacking a point 
of reference, these claims are also not limited to a particular site of mutation within the coding 
region of the plexinBl nucleic acid and the plexinBl mutants can comprise nucleic acids that are 
completely distinct from plexinBl. 

It would not be expected that such a diverse array of mutants of plexin Bl would 
predictably be associated with cancer given that even naturally occurring gene variants, 
such as splice variants, do predictably have the same expression pattern or encode proteins 
with the same function as the related variants.. In particular, Benedict et al (J. Exp. 
Medicine, 2001, 193(1) 89-99) specifically teach that two splice isoforms of terminal 
deoxjoiucleotide transferase (a long form and a short form) enter the nucleus but have 
different activity, the long form does not catalyze nontemplated nucleotide addition but 
rather modulates the activity of the short form (see abstract). Jiang et al (JBC, 2003, 
278(7) 4763-4769) specifically teach that the type 3 Ca2+ release channel, RyR3 exhibits 
strikingly different pharmacologic and fimctional properties depending on the tissues in 
which it resides. Upon examination, seven tissue specific alternatively spliced variants of 
RyR3 were detected. One of the variants was unable to form a functional channel but was 
able to suppress the activity of a different release channel. The authors conclude that 
tissue-specific expression of RyR3 splice variants is likely to account for some of the 
pharmacologic and functional heterogeneities of RyR3 (see abstract). The abstract of 
Matsushita et al (FEBS Letters, 1999, Vol. 443, pp. 348-352) teaches that latrophilins 
exhibit alternative splicing resulting in latrophilin-1, which is present in brain and 
endocrine cells, latrophilin-2, which is ubiquitous, and latrophilin-3 which is brain- 
specific. The abstract of Singh et al (Glycobiology, 2001, Vol. 11, pp. 587-592) teaches 
that the CD44 splice variant, CD44v, is the major PNA-binding glycoprotein in colon 
cancer cells in contrast to standard CD44. These references serve to demonstrate that one 
of skill in the art cannot anticipate the biological activity of the proteins encoded by the 
broadly claimed plexinBl mutants or the tissue distribution of the claimed mutants based 
on the biological activity of the protein encoded by the wild-type or tissue distribution of 
the wild-type nucleic acid or other mutants of plexinBl . Thus, even if it were found that 
the examination of the expression of the A5653G plexin Bl mutant could be used as 
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claimed, undue experimentation would be required to use the broadly claimed mutants or 
even other mutations at position 5653 for the identification of putative anti-cancer agents. 

The specification provides insufficient guidance with regard to the issues set forth 
above and provides insufficient working examples which would provide guidance to one 
skilled in the art and insufficient evidence has been provided which would allow one of 
skill in the art to predict that the invention would fiinction as claimed with a reasonable 
expectation of success. For the above reasons, it appears that undue experimentation would 
be required to practice the claimed invention. 

Applicants argue that the Section 1 12, first paragraph "enablement", rejection of claims 
106, 109 and 1 1 1 is traversed. Reconsideration and withdrawal of the rejection are requested in 
view of the following comments and the attached Wong et al ("Plexin-Bl mutations in prostrate 
cancer" PNAS November 27, 2007, vol. 104, no. 48, 19040-19045). 

Applicants argue that the Examiner is understood to believe that the specification lacks 
experimental data which shows that the claimed mutations are involved in the etiology of cancer, 
so one of skill in the art could, according to the Examiner, not predictably use the claimed 
methods for identification of an anticancer drug without undue experimentation. 

Applicants argue that the Examiner is requested to see the attached Wong et al, which is a 
peer- reviewed publication co-authored by the present inventors which contains the mutation 
data which is set out in the instant specification. Wong et al also contains additional data which 
shows the functional effects of four separate plexinBl mutations (A5359G; A5653G; T5714C 
and C5060T) in cultured cells. 

Applicants argue that all four plexinBl mutants were shown to decrease the shrinkage or 
collapse of COS-7 cells relative to wild-type plexinBl (Wong et al; figure 3c) and to 
significantly increase the adhesion of HEBC293 cells relative to wild-type plexinBl (Wong et al; 
figure 3d). 
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Applicants argue that furthermore, plexinBl mutation was also shown to significantly 
increase the rate of migration of HEK293 cells relative to wild-type plexinBl (Wong et al; figure 
4a) and to increase the invasive capacity of HEK293 cells relative to wild-type plexinBl (Wong 
et al; figure 4b). Expression of plexinBl mutants in HEBC293 cells was also shown to 
significantly increase the percentage of cell spreading and average cell size relative to expression 
of wild-type plexinBl (Wong et al; figures 5a and 5b). 

Applicants argue that in addition, mutation of plexinBl is also shown to inhibit RacGTP 
and R-Ras binding (Wong et al; figures 5c, 6a and 6b), which may contribute to the observed 
increase in cell adhesion and motility (Wong et al; page 19044 col. 1 2nd para) 

Applicants argue that the functional data set out in Wong et al provides further 
confirmation that plexinBl mutation is fimctionally important in the etiology of cancer, and in 
particular cancer progression. For example, Wong et al states at page 19044 col. 1; 

Together these results suggest that Plexin-Bl has a role in prostate cancer progression. 

Applicants argue that Wong et al further state the following at page 19044 col. 2; 

Plexin-Bl is likely to be a key player in cancer invasion and metastasis and is a potential 
target for anticancer therapy. 

Applicants argue that it is therefore evident that plexinBl mutations are involved in the 
etiology of cancer. The claimed methods could therefore be predictably used by one of ordinary 
skill in the art for identifying a compound as a putative anti-cancer agent. 

Applicants' arguments have been considered, but have not been found persuasive because 
the functional studies of Wong et al. of the plexin Bl are based on in vitro studies in cell lines, 
which do not predictably extrapolate to in vivo anti-cancer activity. In particular, the 
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characteristics of cultured cell lines generally differ significantly from the characteristics of the 
primary tumor. As discussed in Freshney (Culture of Animal Cells, A Manual of Basic 
Technique, Alan R. Liss, Inc., 1983, New York, p. 4), it is recognized in the art that there are 
many differences between cultured cells and their counterparts in vivo. These differences stem 
fi-om the dissociation of cells fi-om a three-dimensional geometry and their propagation on a two- 
dimensional substrate. Specific cell interactions characteristic of histology of the tissue are lost. 
The culture environment lacks the input of the nervous and endocrine systems involved in 
homeostatic regulation in vivo. Without this control, cellular metabolism may be more constant 
in vitro but may not be truly representative of the tissue from which the cells were derived. This 
has often led to tissue culture being regarded in a rather skeptical light (p. 4, see Major 
Differences In Vitro). Further, Dermer (Bio/Technology, 1994, 12: 320) teaches that, a petri dish 
cancer is a poor representation of mahgnancy, with characteristics profoundly different fi-om the 
human disease. Dermer further teaches that when a normal or malignant cell adapts to immortal 
life in culture, it takes an evolutionary-type step that enables the new line to thrive in its artificial 
environment and thus transforms a cell from one that is stable and differentiated to one that is 
not. The reference states that evidence of the confradictions between life on the bottom of a lab 
dish and in the body has been in the scientific literature for more than 30 years. Clearly it is well 
known in the art that cells in culture exhibit characteristics different from those in vivo and 
cannot duplicate the complex conditions of the in vivo environment involved in host-tumor and 
cell-cell interactions. Further, the art recognizes the problem of molecular artifacts associated 
with cell culture. For example, Drexler et al (Leukemia and Lymphoma, 1993, 9:1-25) 
specifically teach, in the study of Hodgkin and Reed-Stemberg cancer cells in culture, that the 
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acquisition or loss of certain properties during adaptation to culture systems cannot be excluded. 
This is exemplified by the teachings of Zellner et al (Clin. Can. Res., 1998, 4:1797-1802) who 
specifically teach that products are overexpressed in glioblastoma (GBM)-derived cell lines 
which are not overexpressed in vivo. Drexler et al. further teach that only a few cell lines 
containing cells that resemble the in-vivo cancer cells have been established and even for the 
bona fide cancer cell lines it is difficult to prove that the immortalized cells originated from a 
specific cancer cell (see attached abstract). More recently. Zips et al (In vivo, 2005, 19:1-7) 
specifically teaches that despite their importance for drug testing, in vitro methods are beset by 
pitfalls and inherent limitations (p. 3, col. 1). In particular the authors state that "It is obvious that 
cells in culture represent an artificial and simplified system. Unlike the situation in vitro, a 
tumor is a 3 -dimensional complex consisting of interacting malignant and non-malignant cells. 
Vascularisation, perfusion and thereby, drug access to the tumor cells are not evenly distributed 
and in this fact consists an important source of heterogeneity in tumor response to drugs that 
does not exist in vitro. Therefore, prediction of drug effects in cancer patients based solely on in 
vitro data is not reliable and further evaluations in animal tumor systems is essential" (p. 3, col. 
2). 

Additionally Clark et al. (US Pat. App. Pub. 2006/0019256, January 2006) teach that 
"[ajlthough cell lines have led to remarkable advances in our understanding of the molecular and 
biochemical changes in cancer cells, their use in the identification of effective cancer therapies is 
somewhat limited. Cell lines are imperfect predictors of drug efficacy in de novo tumors. 
Several factors likely account for this deficiency. Cancer cell lines are selected from a sub- 
population of cancer cells that are specifically adapted to growth in tissue culture and the 
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biological and functional properties of these cell lines can change dramatically. Furthermore, 
cancer cells from only a minority of breast cancer tumors establish cell lines or xenograft tumors. 
The phenotypic and fimctional characteristics of these cell lines can change drastically relative to 
their properties in vivo. For example, the marker expression of both normal hematopoietic and 
leukemic tissue culture cells can change rapidly in tissue culture and often does not reflect that of 
the original stem cells from which they were derived. Even when conditions are devised to 
permit the proliferation of normal stem cells in culture, the conditions often promote self-renewal 
or differentiation in a way that prevents the stem cells in culture from recapitulating the 
hierarchy of cell populations that exist in vivo. Taken together, these observations suggest that 
the biological properties of cell lines can differ markedly from the cancer cells from which they 
were derived. This likely explains at least in part why the cell lines often are poor predictors of a 
drug's efficacy in the clinic," see para. 0109. 

Thus, given the above the in vitro cell culture data presented Wang et al. do not provide 
enabling support for the claimed method, in the absence of data that the 5653 mutations affect 
cancer growth in vivo, such as in animal model system. Furthermore, the teachings of Wang et 
al. are not commensurate in scope with the claimed method as the claimed method encompasses 
a much broader array of mutations than those examined by Wang et al. Thus, given the 
unpredictability in the art previously set forth and above, the rejection is maintained for the 
reasons previously set forth and above. 

5. Claims 106, 109, and 111 remain rejected under 35 U.S.C. 1 12, first paragraph, as failing 
to comply with the written description requirement, for the reasons set forth in the Office Action 
of October 27, 2008, section 5, page 10. 
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Examiner argued: 

The limitation of a "the plexinBl coding sequence of AB0007867.1" claimed in Claims 
106, 109, and 111 has no clear support in the specification and the claims as originally filed. A 
review of the specification as revealed support for AB007867.1, see page 8, line 29. Thus 
subject matter claimed in Claims 106, 109, and 111 broadens the scope of the invention as 
originally disclosed in the specification. 

Applicants argue that they believe the amendment will obviate this rejection. 

Applicants' argument has been considered, but has not been found persuasive because 
claims are still drawn to "the plexinBl coding sequence of AB0007867.1" and the specification 
only refers to AB007867.1, e.g. see page 6-line 4. Additionally, a review of the specification and 
claims as originally filed does not reveal support for SEQ ID NO: 112. Thus subject matter 
claimed in Claims 106, 109, and 111 broadens the scope of the invention as originally disclosed 
in the specification. 

New Grounds of Rejection/Objection 
Priority 

6. Applicant's claim for the benefit of a prior-filed application under 35 U.S.C. 
1 19(e) or under 35 U.S.C. 120, 121, or 365(c) is acknowledged. Applicant has not complied 
with one or more conditions for receiving the benefit of an earlier filing date under 35 
U.S.C. 119(e) or under 35 U.S.C. 120, 121, or 365(c) as follows: 

The later-filed application must be an application for a patent for an invention which is 
also disclosed in the prior apphcation (the parent or original nonpro visional application or 
provisional application). The disclosure of the invention in the parent application and in the later- 
filed application must be sufficient to comply with the requirements of the first paragraph of 35 
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U.S.C. 1 12. See Transco Products, Inc. v. Performance Contracting, Inc., 38 F.3d 551, 32 
USPQ2d 1077 (Fed. Cir. 1994). 

The disclosure of the prior-filed apphcations fail to provide adequate support or 
enablement in the manner provided by the first paragraph of 35 U.S.C. 1 12 for one or more 
claims of this application. Examiner has established a priority date of 1 1/10/2005 for claims 106, 
109, and 1 1 1 because the claims as currently constituted recite AB0007867.1 and SEQ ID NO: 
112 and a review of the parent Applications does not reveal the claimed limitations. Applicant is 
invited to submit evidence pointing to the serial number, page and line where support can be 
found establishing an earlier priority date. 

Specification 

7. The amendment filed December 24, 2008 is objected to under 35 U.S.C. 132(a) 
because it introduces new matter into the disclosure. 35 U.S.C. 132(a) states that no amendment 
shall introduce new matter into the disclosure of the invention. The added material which is not 
supported by the original disclosiire is as follows: SEQ ID NO: 111 and SEQ ID NO: 1 12. 

Applicant is required to cancel the new matter in the reply to this Office Action. 
Claim Objections 

8. Claim 1 1 1 is objected to because of the following informalities: The claim does 
not end with a period. Appropriate correction is required. 

Claim Rejections - 35 USC § 112 

9. Claims 106, 109, and 111 are rejected as failing to comply with the written 
description requirement. The claim(s) contains subject matter which was not described in the 
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specification in such a way as to reasonably convey to one skilled in the relevant art that the 
inventor(s), at the time the application was filed, had possession of the claimed invention. 

The claims are broadly drawn to a method of identifying a compound as a putative anti- 
cancer agent, the method comprising; determining the expression of a plexinBl nucleic acid in a 
cell or cell lysate in the presence of a test compound, wherein said plexinB 1 nucleic acid 
comprises mutations in the coding region of the nucleic acid at position 5653 of the plexinBl 
coding sequence of AB0007867.1 (SEQ ID NO: 1 12), and; wherein the cancer is prostate or 
breast cancer. When given the broadest reasonable interpretation, a plexinBl nucleic acid 
encompasses any nucleic acid comprising a mutation of the sequence at position 5653 of 
AB0007867.1 or SEQ ID NO: 1 12, given that the specification teaches that a mutant plexinBl 
nucleic acid may comprise a nucleotide sequence which has one or more mutations relative to 
the wild-type plexinBl nucleotide sequence, as set out in AB007867 and the mutations may be 
deletions, insertions or substitutions of one or more nucleotides (see the paragraph bridging 
pages 3-4 of the specification as originally filed) and given that the coding sequence of 
AB0007867. 1 is not defined and is not SEQ ID NO: 1 12, see Appendix 1 . In other words the 
claims encompass any nucleic acid comprising a mutation of position 5653 (or any other of the 
claimed mutation sites) of AB0007867. 1 or SEQ ID NO: 1 12 with AB0007867.1 being 
undefined and does not require the retention of any other plexinBl sequences. Furthermore, 
dependent claim 109 is not even limited to having the mutation be from AB0007867. 1 or SEQ 
ID NO: 1 12, the scope of which is encompassed by independent claim 106. Thus, the genus of 
plexinBl nucleic acids which comprise one or more mutations is highly variable that varies 
significantly both in structure and fimction. The description of plexinBl mutations (see Table 1 
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and 2) in the specification fails to adequately describe the genus of plexin Bl mutations because 
said genus tolerates members which differ significantly in both structure and function fi-om the 
plexinBl nucleic acid. One of skill in the art can reasonably conclude that applicant was not in 
possession of a genus of plexinBl nucleic acids which comprise one or more mutations at the 
time the invention was filed. Because the genus of plexinBl nucleic acids which comprise one 
or more mutations is not adequately described, the method claims relying on said genus are also 
not adequately described. 

As it is drawn to DNA arts, the findings in University of California v. Eli Lilly and Co., 
119 F.3d 1559, 43 USPQ2d 1398 (Fed. Cir. 1997) and Enzo Biochem, Inc. V. Gen-Probe Inc. are 
relevant to the instant claims. The Federal Circuit addressed the application of the written 
description requirement to DNA-related inventions in University of California v. Eli Lilly and 
Co., 1 19 F.3d 1559, 43 USPQ2d 1398 (Fed. Cir. 1997). The court stated that "[a] written 
description of an invention involving a chemical genus, like a description of a chemical species, 
'requires a precise definition, such as by structure, formula, [or] chemical name,' of the claimed 
subject matter sufficient to distinguish it from other materials." Id. At 1567, 43 USPQ2d at 
1405. The court also stated that a generic statement such as "vertebrate insulin cDNA" or 
"mammalian insulin cDNA" without more, is not an adequate written description of the genus 
because it does not distinguish the genus from others, except by function. It does not specifically 
define any of the genes that fall within its definition. It does not define any structural features 
commonly possessed by members of the genus that distinguish them from others. One skilled in 
the art therefore cannot, as one can do with a fully described genus, visualize or recognize the 
identity of the members of the genus. A definition by function, as we have previously indicated. 
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does not suffice to define the genus because it is only an indication of what the gene does, rather 
than what it is. Id. At 1568, 43 USPQ2d at 1406. The court concluded that "naming a type of 
material generally known to exist, in the absence of knowledge as to what that material consists 
of, is not a description of that material." Id. 

In the instant case the genus of plexinBl nucleic acids which comprise one or more 
mutations is so broad that it does not define the members that do or do not fall with the genus 
and it does not define any structural features commonly possessed by members of the genus that 
distinguish them from nucleic acids not encompassed by the genus. 

In the remarks of July 1 1, 2008, Applicants argued that the claimed invention is described 
in the specification in a manner that one of ordinary skill will appreciate that the applicants were 
in possession of the claimed invention at the time the application was filed. The present claims 
relate to a sub-genus of plexin Bl nucleic acids which contain a mutation located at one of a 
number of specified positions in the coding sequence of plexinBl identified by reference to the 
sequence of database entry AB0007867.1. 

Applicants argued that that the description of plexinBl mutations in the specification 
(e.g. Tables 1 and 2) adequately describes the claimed invention, since one example of a plexin 
Bl nucleic acid with a mutation at each position is disclosed. Furthermore, the applicants believe 
that since the mutations are located at specified sites in the plexinB 1 sequence, the claimed 
invention does not include species which differ significantly in either structure or fiinction from 
these examples. 
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Applicants argued that the sites of mutation within the plexin Bl nucleic acids are 
identified by reference to the plexin Bl sequence of database entry AB0007867.1, so the 
positions of these mutations within the plexin Bl sequence can be readily determined. 

Applicants argued that the subject-matter of the present claims is therefore described in 
the specification such a way as to reasonably convey to one skilled in the relevant art that the 
inventors has possession of the claimed invention at the time the application was filed. 

Applicants' arguments have been considered, but have not been found persuasive because 
the claims are not limited to plexinBl mutants of AB0007867.1 or SEQ ID NO: 12. The claims 
encompass a plexin Bl nucleic acid that comprise mutations in the coding region that comprise a 
single nucleic acid mutation at position 5653 (or any of the other claimed mutation sites) of 
AB0007867.1 or SEQ ID NO: 12 . Furthermore, dependent claim 109 is not even limited to 
having the mutation be fi-om AB0007867.1 (which is not defined) or SEQ ID NO: 1 12, the scope 
of which is encompassed by independent claim 106. Thus, the claims encompass nucleic acids 
that comprise no sequences related to AB0007867.1 or SEQ ID NO: 12. Thus, the description of 
a single plexinBl and its mutations in the specification, fails to adequately describe this vast 
genus of nucleic acids. 

Claim Rejections - 35 USC § 102 
The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in pubUc use or on 
sale in this country, more than one year prior to the date of appUcation for patent in the United States. 
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10. Claims 106, 109, and 111 are rejected under 35 U.S.C. 102(b) as being anticipated 
by WO 02/068579 A2 (Venter et al. 6 September 2002). 
The claims are drawn to: 

106. A method of identifying a compound as a putative anti-cancer agent, the method 
comprising; determining the expression of a plexinBl nucleic acid in a cell or cell lysate in the 
presence of a test compound, wherein said plexinBl nucleic acid comprises mutations in the 
coding region of the nucleic acid at position 5653 of the plexinBl coding sequence of 
AB0007867.1 (SEQ ID NO: 1 12), and; wherein the cancer is prostate or breast cancer. 

109. A method according to claim 106, wherein the one or more mutations is A5653G. 

111. A method according to claim 1 06, comprising determining a decrease in the 
expression of mutant plexin Bl in the presence of said test compound. 

When given the broadest reasonable interpretation, a plexinBl nucleic acid encompasses 
any nucleic acid comprising a mutation of the sequence at position 5653 of AB0007867.1 or 
SEQ ID NO: 112, given that the specification teaches that a mutant plexinBl nucleic acid may 
comprise a nucleotide sequence which has one or more mutations relative to the wild-type 
plexinBl nucleotide sequence, as set out in AB007867 and the mutations may be deletions, 
insertions or substitutions of one or more nucleotides (see the paragraph bridging pages 3-4 of 
the specification as originally filed) and given that the coding sequence of AB0007867.1 is not 
defined and is not SEQ ID NO: 112, see Appendix 1 . In other words the claims encompass any 
nucleic acid comprising a mutation of position 5653 (or any other of the claimed mutation sites) 
of AB0007867.1 or SEQ ID NO: 1 12 with AB0007867.1 being undefined and does not require 
the retention of any other plexinBl sequences. Furthermore, dependent claim 109 is not even 
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limited to having the mutation from AB0007867.1 or SEQ ID NO: 1 12, the scope of which is 
encompassed by independent claim 106 and thus the A to G mutation could be anywhere in the 
mutant sequence. 

It is noted that the recitation of "a method of identifying a compound as a putative anti- 
cancer agent . . . wherein the cancer is prostate or breast cancer" in claim 106 is merely 
suggestive of an intended use that does not result in a structural difference between the claimed 
invention and the prior art in order to patentably distinguish the claimed invention from the prior 
art and thus is not given weight for comparison of the claims with the prior art. 

WO 02/068579 teaches a plexinBl sequence that is mutated relative to SEQ ID NO: 12 
with a mutation at position 5653 and contains A to G mutations relative to SEQ ID NO: 12, see 
position 5579 for example, see Appendix 2. WO 02/068579 teaches determining the expression 
of the franscripts of the invention in cells in the presence of compounds in drug development and 
determining decreases in the expression of the franscripts in cells in the presence of the 
compounds under development, see page 30. 

11. No claims allowed. 

12. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to PETER J. REDDIG whose telephone number is (571)272-9031. 
The examiner can normally be reached on M-F 8:30 a.m.-5:00 p.m. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor. Helms Larry can be reached on (571) 272-0832. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 


/Peter J Reddig/ 
Examiner, Art Unit 1642 
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Appendix 1 


AB007867 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 


AB007867 7308 bp mRNA linear PRI lO-JAN-2004 

Homo sapiens KIAA04 07 mRNA, partial cds . 

AB007867 

AB007867.1 GI:2662094 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 

AUTHORS Ishikawa,K., Nagase,T., Nakajima,D., Seki,N., Ohira,M., 

Miyajima,N., Tanaka,A., Kotani,H., Nomura, II. and Ohara,0. 

TITLE Prediction of the coding sequences of unidentified human genes. 

VIII. 78 new cDNA clones from brain which code for large proteins 
in vitro 

JOURNAL DNA Res. 4 (5), 307-313 (1997) 
PUBMED 9455477 
REFERENCE 2 (bases 1 to 7308) 
AUTHORS Ohara,0. 
TITLE Direct Submission 

JOURNAL Submitted ( 06-OCT-l 997 ) Osamu Ohara, Kazi 
Laboratory of DNA Technology; Yana 1532-! 
292-0812, Japan (E-mail:cdnainfo@kazusa.or. jp, Tel : +81-438-52-3913, 
Fax:+81-438-52-3914) 
FEATURES Location/Qualifiers 
source 1. .7308 

/organ ism="Homo sapiens" 
/mo l_t ype= "mRNA" 
/db_xref="taxon: 9606" 
/clone="HG1339" 
/sex="male" 
/tissue_type="brain" 
/clone_lib="pBluescriptII SK plus" 
gene <1 . .7308 

/gene="KIAA0407" 
CDS <247. .6678 

/gene="KIAA0407" 

/note="Start codon is not identified" 

/codon_start=l 
/protein_id="BAA23703.2" 
/db_xref="GI: 40788245" 

/translation="SSRPCQVTMPALGPALLQALWAGWVLTLQPLPPTAFTPNGTYLQ 
HLARDPTSGTLYLGATNFLFQLSPGLQLEATVSTGPVLDSRDCLPPVMPDECPQAQPT 
NNPNQLLLVSPGALWCGSVHQGVCEQRRLGQLEQLLLRPERPGDTQYVAANDPAVST 
VGLVAQGLAGEPLLFVGRGYTSRGVGGGIPPITTRALWPPDPQAAFSYEETAKLAVGR 
LSEYSHHFVSAFARGASAYFLFLRRDLQAQSRAFRAYVSRVCLRDQHYYSYVELPLAC 
EGGRYGLIQAAAVATSREVAHGEVLFAAFSSAAPPTVGRPPSAAAGASGASALCAFPL 
DEVDRLANRTRDACYTREGRAEDGTEVAYIEYDVNSDCAQLPVDTLDAYPCGSDHTPS 
PMASRVPLEATPILEWPGIQLTAVAVTMEDGHTIAFLGDSQGQLHRVYLGPGSDGHPY 
STQSIQQGSAVSRDLTFDGTFEHLYVMTQSTLLKVPVASCAQHLDCASCLAHRDPYCG 
WCVLLGRCSRRSECSRGQGPEQWLWSFQPELGCLQVAAMSPANISREETREVFLSVPD 
LPPLWPGESYSCHFGEHQSPALLTGSGVMCPSPDPSEAPVLPRGADYVSVSVELRFGA 
WIAKTSLSFYDCVAVTELRPSAQCQACVSSRWGCNWCVWQHLCTHKASCDAGPMVAS 
HQSPLVSPDPPARGGPSPSPPTAPKALATPAPDTLPVEPGAPSTATASDISPGASPSL 
LSPWGPWAGSGSISSPGSTGSPLHEEPSPPSPQNGPGTAVPAPTDFRPSATPEDLLAS 
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PLSPSEVAAVPPADPGPEALHPTVPLDLPPATVPATTFPGAMGSVKPALDWLTREGGE 
LPEADEWTGGDAPAFSTSTLLSGDGDSAELEGPPAPLILPSSLDYQYDTPGLWELEEA 
TLGASSCPCVESVQGSTLMPVHVEREIRLLGRNLHLFQDGPGDNECVMELEGLEVWE 
ARVECEPPPDTQCHVTCQQHQLSYEALQPELRVGLFLRRAGRLRVDSAEGLHWLYDC 
SVGHGDCSRCQTAMPQYGCVWCEGERPRCVTREACGEAEAVATQCPAPLIHSVEPLTG 
PVDGGTRVTIRGSNLGQHVQDVLGMVTVAGVPCAVDAQEYEVSSSLVCITGASGEEVA 
GATAVEVPGRGRGVSEHDFAYQDPKVHSIFPARGPRAGGTRLTLNGSKLLTGRLEDIR 
VWGDQPCHLLPEQQSEQLRCETSPRPTPATLPVAVWFGATERRLQRGQFKYTLDPNI 
TSAGPTKSFLSGGREICVRGQNLDWQTPRIRVTWSRMLQPSQGLGRRRRWPETAC 
SLGPSCSSQQFEEPCHVNSSQLITCRTPALPGLPEDPWVRVEFILDNLVFDFATLNPT 
PFSYEADPTLQPLNPEDPTMPFRHKPGSVFSVEGENLDLAMSKEEWAMIGDGPCWK 
TLTRHHLYCEPPVEQPLPRHHALREAPDSLPEFTVQMGNLRFSLGHVQYDGESPGAFP 
VAAQVGLGVGTSLLALGVIIIVLMYRRKSKQALRDYKKVQIQLENLESSVRDRCKKEF 
TDLMTEMTDLTSDLLGSGIPFLDYBWYAERIFFPGHRESPLHRDLGVPESRRPTVEQG 
LGQLSNLLNSKLFLTKFIHTLESQRTFSARDRAYVASLLTVALHGKLEYFTDILRTLL 
SDLVAQYVAKNPKLMLRRTETWEKLLTNWMSICLYTFVRDSVGEPLYMLFRGIKHQV 
DKGPVDSVTGKAKYTLNDNRLLREDVEYRPLTLNALLAVGPGAGEAQGVPVKVLDCDT 
ISQAKEKMLDQLYKGVPLTQRPDPRTLDVEWRSGVAGHLILSDEDVTSEVQGLWRRLN 
TLQHYKVPDGATVALVPCLTKHVLRENQDYVPGERTPMLEDVDEGGIRPWHLVKPSDE 
PEPPRPRRGSLRGGERERAKAIPEIYLTRLLSMKGTLQKFVDDLFQVILSTSRPVPLA 
VKYFFDLLDEQAQQHGISDQDTIHIWKTNSLPLRFWINIIKWPQFVFDVQTSDNMDAV 
LLVIAQTFMDACTLADHKLGRDSPINKLLYARDIPRYKRMVERYYADIRQTVPASDQE 
MNSVLAELSWNYSGDLGARVALHELYKYINKYYDQIITALEEDGTAQKMQLGYRLQQI 
AAAVENKVTDL" 


Query Match 100.0%; Score 7308; DB 5; Length 7308; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 7308; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 


1 GCCCGGCCGACGCGGCTTTGTCTCCTTTGTTCCCGGCGGTGGCAGCGCCGCGCGGGAGGG 6 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 GCCCGGCCGACGCGGCTTTGTCTCCTTTGTTCCCGGCGGTGGCAGCGCCGCGCGGGAGGG 60 


Qy 61 GCGGGCAGCGGGCGCAGTTTTCCGCCCCTCGGTCTCCGGGTAACAGCTGCGGCTCCACCA 120 

Db 61 GCGGGCAGCGGGCGCAGTTTTCCGCCCCTCGGTCTCCGGGTAACAGCTGCGGCTCCACCA 120 


Qy 121 GACCCGGGGAGAGGCCGCTGCGCGCGGAGCCCGAGCCCGGAGCGGCCGACGCCCGCCTCG 180 

Db 121 GACCCGGGGAGAGGCCGCTGCGCGCGGAGCCCGAGCCCGGAGCGGCCGACGCCCGCCTCG 180 


Qy 181 GCGCGCACATCCCGCGGGGCCCGGCCGGGTGGTGACTCCCACACGGGTCATGCTGTTGTC 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 GCGCGCACATCCCGCGGGGCCCGGCCGGGTGGTGACTCCCACACGGGTCATGCTGTTGTC 24 0 


2 41 TCCTGATCCAGCCGGCCCTGCCAGGTGACCATGCCTGCTCTGGGCCCAGCTCTTCTCCAG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
241 TCCTGATCCAGCCGGCCCTGCCAGGTGACCATGCCTGCTCTGGGCCCAGCTCTTCTCCAG 300 


301 GCTCTCTGGGCCGGGTGGGTCCTCACCCTCCAGCCCCTTCCACCAACTGCATTCACTCCC 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
301 GCTCTCTGGGCCGGGTGGGTCCTCACCCTCCAGCCCCTTCCACCAACTGCATTCACTCCC 360 


361 AATGGCACGTATCTGCAGCACCTGGCAAGGGACCCCACCTCAGGCACCCTCTACCTGGGG 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 AATGGCACGTATCTGCAGCACCTGGCAAGGGACCCCACCTCAGGCACCCTCTACCTGGGG 420 


Qy 


421 GCTACCAACTTCCTGTTCCAGCTGAGCCCTGGGCTGCAGCTGGAGGCCACAGTGTCCACC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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4 21 GCTACCAACTTCCTGTTCCAGCTGAGCCCTGGGCTGCAGCTGGAGGCCACAGTGTCCACC 4 80 

481 GGCCCTGTGCTAGACAGCAGGGACTGCCTGCCACCTGTGATGCCTGATGAGTGCCCCCAG 540 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

481 GGCCCTGTGCTAGACAGCAGGGACTGCCTGCCACCTGTGATGCCTGATGAGTGCCCCCAG 540 

541 GCCCAGCCTACCAACAACCCGAATCAGCTGCTCCTGGTGAGCCCAGGGGCCCTGGTGGTA 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
541 GCCCAGCCTACCAACAACCCGAATCAGCTGCTCCTGGTGAGCCCAGGGGCCCTGGTGGTA 600 

601 TGCGGGAGCGTGCACCAGGGGGTCTGTGAACAGCGGCGCCTGGGGCAGCTCGAGCAGCTG 660 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

601 TGCGGGAGCGTGCACCAGGGGGTCTGTGAACAGCGGCGCCTGGGGCAGCTCGAGCAGCTG 660 

661 CTGCTGCGGCCAGAGCGGCCTGGGGACACACAATATGTGGCTGCCAATGATCCTGCGGTC 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 CTGCTGCGGCCAGAGCGGCCTGGGGACACACAATATGTGGCTGCCAATGATCCTGCGGTC 720 

721 AGCACGGTGGGGCTGGTAGCCCAGGGCTTGGCAGGGGAGCCCCTCCTGTTTGTGGGGCGA 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 AGCACGGTGGGGCTGGTAGCCCAGGGCTTGGCAGGGGAGCCCCTCCTGTTTGTGGGGCGA 780 

781 GGATACACCAGCAGGGGTGTGGGGGGTGGCATTCCACCCATCACAACCCGGGCCCTGTGG 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
781 GGATACACCAGCAGGGGTGTGGGGGGTGGCATTCCACCCATCACAACCCGGGCCCTGTGG 840 

841 CCGCCCGACCCCCAAGCTGCCTTCTCCTATGAGGAGACAGCCAAGCTGGCAGTGGGCCGC 900 

841 CCGCCCGACCCCCAAGCTGCCTTCTCCTATGAGGAGACAGCCAAGCTGGCAGTGGGCCGC 900 

901 CTCTCCGAGTACAGCCACCACTTCGTGAGTGCCTTTGCACGTGGGGCCAGCGCCTACTTC 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
901 CTCTCCGAGTACAGCCACCACTTCGTGAGTGCCTTTGCACGTGGGGCCAGCGCCTACTTC 960 

961 CTGTTCCTGCGGCGGGACCTGCAGGCTCAGTCTAGAGCTTTTCGTGCCTATGTATCTCGA 1020 

961 CTGTTCCTGCGGCGGGACCTGCAGGCTCAGTCTAGAGCTTTTCGTGCCTATGTATCTCGA 1020 

1021 GTGTGTCTCCGGGACCAGCACTACTACTCCTATGTGGAGTTGCCTCTGGCCTGCGAAGGT 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1021 GTGTGTCTCCGGGACCAGCACTACTACTCCTATGTGGAGTTGCCTCTGGCCTGCGAAGGT 1080 

1081 GGCCGCTACGGGCTGATCCAGGCTGCAGCTGTGGCCACGTCCAGGGAGGTGGCGCATGGG 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1081 GGCCGCTACGGGCTGATCCAGGCTGCAGCTGTGGCCACGTCCAGGGAGGTGGCGCATGGG 114 0 

1141 GAGGTGCTCTTTGCAGCTTTCTCCTCGGCTGCACCCCCCACTGTGGGCCGGCCCCCATCG 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1141 GAGGTGCTCTTTGCAGCTTTCTCCTCGGCTGCACCCCCCACTGTGGGCCGGCCCCCATCG 1200 

1201 GCGGCTGCTGGGGCATCTGGAGCCTCTGCCCTCTGTGCCTTCCCCCTGGATGAGGTGGAC 12 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1201 GCGGCTGCTGGGGCATCTGGAGCCTCTGCCCTCTGTGCCTTCCCCCTGGATGAGGTGGAC 12 60 

12 61 CGGCTTGCTAATCGCACGCGAGATGCCTGCTACACCCGGGAGGGTCGTGCTGAGGATGGG 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
12 61 CGGCTTGCTAATCGCACGCGAGATGCCTGCTACACCCGGGAGGGTCGTGCTGAGGATGGG 1320 

1321 ACCGAGGTGGCCTACATCGAGTATGATGTCAATTCTGACTGTGCACAGCTGCCAGTGGAC 1380 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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1321 ACCGAGGTGGCCTACATCGAGTATGATGTCAATTCTGACTGTGCACAGCTGCCAGTGGAC 1380 

1381 ACCCTGGATGCTTATCCCTGTGGCTCAGACCACACGCCCAGCCCCATGGCCAGCCGGGTC 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1381 ACCCTGGATGCTTATCCCTGTGGCTCAGACCACACGCCCAGCCCCATGGCCAGCCGGGTC 1440 

1441 CCGCTGGAAGCCACACCAATTCTGGAGTGGCCAGGGATTCAGCTAACAGCTGTGGCAGTC 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1441 CCGCTGGAAGCCACACCAATTCTGGAGTGGCCAGGGATTCAGCTAACAGCTGTGGCAGTC 1500 

1501 ACCATGGAAGATGGACACACCATCGCTTTCCTGGGTGATAGTCAAGGGCAGCTGCACAGG 1560 

I I I I I I I I I I I I I I I 1 I 1 1 1 1 1 I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1501 ACCATGGAAGATGGACACACCATCGCTTTCCTGGGTGATAGTCAAGGGCAGCTGCACAGG 15 60 

15 61 GTCTACTTGGGCCCAGGGAGCGATGGCCACCCATACTCCACACAGAGCATCCAGCAGGGG 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1561 GTCTACTTGGGCCCAGGGAGCGATGGCCACCCATACTCCACACAGAGCATCCAGCAGGGG 1620 

1621 TCTGCAGTGAGCAGAGACCTCACCTTTGATGGGACCTTTGAGCACCTGTATGTCATGACC 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1621 TCTGCAGTGAGCAGAGACCTCACCTTTGATGGGACCTTTGAGCACCTGTATGTCATGACC 1680 

1681 CAGAGCACACTTCTGAAGGTTCCTGTGGCTTCCTGTGCTCAGCACCTGGACTGTGCATCT 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1681 CAGAGCACACTTCTGAAGGTTCCTGTGGCTTCCTGTGCTCAGCACCTGGACTGTGCATCT 1740 

17 41 TGCCTTGCTCACAGGGACCCATACTGTGGGTGGTGCGTGCTCCTTGGCAGGTGCAGTCGC 1800 

17 41 TGCCTTGCTCACAGGGACCCATACTGTGGGTGGTGCGTGCTCCTTGGCAGGTGCAGTCGC 1800 

1801 CGTTCTGAGTGCTCGAGGGGCCAGGGCCCAGAGCAGTGGCTATGGAGCTTCCAGCCTGAG 1860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1801 CGTTCTGAGTGCTCGAGGGGCCAGGGCCCAGAGCAGTGGCTATGGAGCTTCCAGCCTGAG 1860 

1861 CTGGGCTGTCTGCAAGTGGCAGCCATGAGTCCTGCCAACATCAGCCGAGAGGAGACGAGG 1920 

1861 CTGGGCTGTCTGCAAGTGGCAGCCATGAGTCCTGCCAACATCAGCCGAGAGGAGACGAGG 1920 

1921 GAGGTTTTCCTATCAGTGCCAGACCTGCCACCCCTGTGGCCAGGGGAGTCATATTCCTGC 1980 

1921 GAGGTTTTCCTATCAGTGCCAGACCTGCCACCCCTGTGGCCAGGGGAGTCATATTCCTGC 1980 

1981 CACTTTGGGGAACATCAGAGTCCTGCCCTGCTGACTGGTTCTGGTGTGATGTGCCCCTCC 2040 

1981 CACTTTGGGGAACATCAGAGTCCTGCCCTGCTGACTGGTTCTGGTGTGATGTGCCCCTCC 204 0 

2041 CCAGACCCTAGTGAGGCCCCAGTGCTGCCGAGAGGAGCCGACTACGTATCCGTGAGCGTG 2100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2041 CCAGACCCTAGTGAGGCCCCAGTGCTGCCGAGAGGAGCCGACTACGTATCCGTGAGCGTG 2100 

2101 GAGCTCAGATTTGGCGCTGTTGTGATCGCCAAAACTTCCCTCTCTTTCTATGACTGTGTG 2160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2101 GAGCTCAGATTTGGCGCTGTTGTGATCGCCAAAACTTCCCTCTCTTTCTATGACTGTGTG 2160 

2161 GCGGTCACTGAACTCCGCCCATCTGCGCAGTGCCAGGCCTGTGTGAGCAGCCGCTGGGGG 2220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2161 GCGGTCACTGAACTCCGCCCATCTGCGCAGTGCCAGGCCTGTGTGAGCAGCCGCTGGGGG 2220 

2221 TGTAACTGGTGTGTCTGGCAGCACCTGTGCACCCACAAGGCCTCGTGTGATGCTGGGCCC 2280 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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2221 TGTAACTGGTGTGTCTGGCAGCACCTGTGCACCCACAAGGCCTCGTGTGATGCTGGGCCC 22 80 

2281 ATGGTTGCAAGCCATCAGAGCCCGCTTGTCTCCCCAGACCCTCCTGCAAGAGGTGGACCC 2340 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2281 ATGGTTGCAAGCCATCAGAGCCCGCTTGTCTCCCCAGACCCTCCTGCAAGAGGTGGACCC 2340 

2341 AGCCCCTCCCCACCCACAGCCCCCAAAGCCCTGGCCACCCCTGCTCCTGACACCCTTCCC 2400 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2341 AGCCCCTCCCCACCCACAGCCCCCAAAGCCCTGGCCACCCCTGCTCCTGACACCCTTCCC 2400 

2401 GTGGAGCCTGGGGCTCCCTCCACAGCCACAGCTTCGGACATCTCACCTGGGGCTAGTCCT 24 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

24 01 GTGGAGCCTGGGGCTCCCTCCACAGCCACAGCTTCGGACATCTCACCTGGGGCTAGTCCT 24 60 

24 61 TCCCTGCTCAGCCCCTGGGGGCCATGGGCAGGTTCTGGCTCCATATCTTCCCCTGGCTCC 2520 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
24 61 TCCCTGCTCAGCCCCTGGGGGCCATGGGCAGGTTCTGGCTCCATATCTTCCCCTGGCTCC 2520 

2521 ACAGGGTCGCCTCTCCATGAGGAGCCCTCCCCTCCCAGCCCCCAAAATGGACCTGGAACC 2580 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2521 ACAGGGTCGCCTCTCCATGAGGAGCCCTCCCCTCCCAGCCCCCAAAATGGACCTGGAACC 2580 

2581 GCTGTCCCTGCCCCCACTGACTTCAGACCCTCAGCCACACCTGAGGACCTCTTGGCCTCC 2640 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2581 GCTGTCCCTGCCCCCACTGACTTCAGACCCTCAGCCACACCTGAGGACCTCTTGGCCTCC 2640 

2 641 CCGCTGTCACCGTCAGAGGTAGCAGCAGTGCCCCCTGCAGACCCTGGCCCCGAGGCTCTT 27 00 

2641 CCGCTGTCACCGTCAGAGGTAGCAGCAGTGCCCCCTGCAGACCCTGGCCCCGAGGCTCTT 2700 

2701 CATCCCACAGTGCCCCTGGACCTGCCCCCTGCCACTGTTCCTGCCACCACTTTCCCAGGG 27 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2701 CATCCCACAGTGCCCCTGGACCTGCCCCCTGCCACTGTTCCTGCCACCACTTTCCCAGGG 27 60 

27 61 GCCATGGGCTCCGTGAAGCCCGCCCTGGACTGGCTCACGAGAGAAGGCGGCGAGCTGCCC 2820 

27 61 GCCATGGGCTCCGTGAAGCCCGCCCTGGACTGGCTCACGAGAGAAGGCGGCGAGCTGCCC 2820 

2821 GAGGCGGACGAGTGGACGGGGGGTGACGCACCCGCCTTCTCCACTTCCACCCTCCTCTCA 2880 

2821 GAGGCGGACGAGTGGACGGGGGGTGACGCACCCGCCTTCTCCACTTCCACCCTCCTCTCA 2880 

2881 GGTGATGGAGACTCAGCAGAGCTTGAGGGCCCTCCCGCCCCCCTCATCCTCCCGTCCAGC 2 940 

2881 GGTGATGGAGACTCAGCAGAGCTTGAGGGCCCTCCCGCCCCCCTCATCCTCCCGTCCAGC 2 94 0 

2941 CTCGACTACCAGTATGACACCCCCGGGCTCTGGGAGCTGGAAGAGGCGACCTTGGGGGCA 3000 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2941 CTCGACTACCAGTATGACACCCCCGGGCTCTGGGAGCTGGAAGAGGCGACCTTGGGGGCA 3000 

3001 AGCTCCTGCCCCTGTGTGGAGAGCGTTCAGGGCTCCACGTTGATGCCGGTCCATGTGGAG 3060 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3001 AGCTCCTGCCCCTGTGTGGAGAGCGTTCAGGGCTCCACGTTGATGCCGGTCCATGTGGAG 3060 

3061 CGGGAAATCCGGCTGCTAGGCAGGAACCTGCACCTTTTCCAGGATGGCCCAGGAGACAAT 3120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3061 CGGGAAATCCGGCTGCTAGGCAGGAACCTGCACCTTTTCCAGGATGGCCCAGGAGACAAT 3120 

3121 GAGTGTGTGATGGAGCTGGAGGGCCTCGAGGTGGTGGTTGAGGCCCGGGTCGAGTGTGAG 3180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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3121 GAGTGTGTGATGGAGCTGGAGGGCCTCGAGGTGGTGGTTGAGGCCCGGGTCGAGTGTGAG 3180 

3181 CCACCTCCAGATACCCAGTGCCATGTCACCTGCCAGCAGCACCAGCTCAGCTATGAGGCT 3240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3181 CCACCTCCAGATACCCAGTGCCATGTCACCTGCCAGCAGCACCAGCTCAGCTATGAGGCT 3240 

3241 CTGCAGCCGGAGCTCCGTGTGGGGCTGTTTCTGCGTCGGGCCGGCCGTCTGCGTGTGGAC 3300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3241 CTGCAGCCGGAGCTCCGTGTGGGGCTGTTTCTGCGTCGGGCCGGCCGTCTGCGTGTGGAC 3300 

3301 AGTGCTGAGGGGCTGCATGTGGTACTGTATGACTGTTCCGTGGGACATGGAGACTGCAGC 3360 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

3301 AGTGCTGAGGGGCTGCATGTGGTACTGTATGACTGTTCCGTGGGACATGGAGACTGCAGC 33 60 

33 61 CGCTGCCAAACTGCCATGCCCCAGTATGGCTGTGTGTGGTGTGAGGGGGAGCGTCCACGT 3420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3361 CGCTGCCAAACTGCCATGCCCCAGTATGGCTGTGTGTGGTGTGAGGGGGAGCGTCCACGT 3420 

3421 TGTGTGACCCGGGAGGCCTGTGGTGAGGCTGAGGCTGTGGCCACCCAGTGCCCAGCGCCC 3480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3421 TGTGTGACCCGGGAGGCCTGTGGTGAGGCTGAGGCTGTGGCCACCCAGTGCCCAGCGCCC 3480 

3481 CTCATCCACTCGGTGGAGCCACTGACTGGGCCTGTAGACGGAGGCACCCGTGTCACCATC 3540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3481 CTCATCCACTCGGTGGAGCCACTGACTGGGCCTGTAGACGGAGGCACCCGTGTCACCATC 3540 

3541 AGGGGCTCCAACCTGGGCCAGCATGTGCAGGATGTGCTGGGCATGGTCACGGTGGCTGGA 3600 

3541 AGGGGCTCCAACCTGGGCCAGCATGTGCAGGATGTGCTGGGCATGGTCACGGTGGCTGGA 3600 

3601 GTGCCCTGTGCTGTGGATGCCCAGGAGTACGAGGTCTCCAGCAGCCTCGTGTGCATCACC 3660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3601 GTGCCCTGTGCTGTGGATGCCCAGGAGTACGAGGTCTCCAGCAGCCTCGTGTGCATCACC 3660 

3661 GGGGCCAGTGGGGAGGAGGTGGCCGGCGCCACAGCGGTGGAGGTGCCGGGAAGAGGACGT 3720 

3661 GGGGCCAGTGGGGAGGAGGTGGCCGGCGCCACAGCGGTGGAGGTGCCGGGAAGAGGACGT 3720 

3721 GGTGTCTCAGAACACGACTTTGCCTACCAGGATCCGAAGGTCCATTCCATCTTCCCGGCC 3780 

3721 GGTGTCTCAGAACACGACTTTGCCTACCAGGATCCGAAGGTCCATTCCATCTTCCCGGCC 37 80 

37 81 CGCGGCCCCAGAGCTGGGGGCACCCGTCTCACCCTGAATGGCTCCAAGCTCCTGACTGGG 384 0 

37 81 CGCGGCCCCAGAGCTGGGGGCACCCGTCTCACCCTGAATGGCTCCAAGCTCCTGACTGGG 384 0 

3841 CGGCTGGAGGACATCCGAGTGGTGGTTGGAGACCAGCCTTGTCACTTGCTGCCGGAGCAG 3900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3841 CGGCTGGAGGACATCCGAGTGGTGGTTGGAGACCAGCCTTGTCACTTGCTGCCGGAGCAG 3900 

3901 CAGTCAGAACAACTGCGGTGTGAGACCAGCCCACGCCCCACGCCTGCCACGCTCCCTGTG 3960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3901 CAGTCAGAACAACTGCGGTGTGAGACCAGCCCACGCCCCACGCCTGCCACGCTCCCTGTG 3960 

3961 GCTGTGTGGTTTGGGGCCACGGAGCGGAGGCTTCAACGCGGACAGTTCAAGTATACCTTG 4020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
3961 GCTGTGTGGTTTGGGGCCACGGAGCGGAGGCTTCAACGCGGACAGTTCAAGTATACCTTG 4020 

4021 GACCCCAACATCACCTCTGCTGGCCCCACCAAGAGCTTCCTCAGTGGAGGACGTGAGATA 4080 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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4 021 GACCCCAACATCACCTCTGCTGGCCCCACCAAGAGCTTCCTCAGTGGAGGACGTGAGATA 4 080 

4081 TGCGTCCGTGGCCAGAATCTGGACGTGGTACAGACGCCAAGAATCCGGGTGACCGTGGTC 4140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4081 TGCGTCCGTGGCCAGAATCTGGACGTGGTACAGACGCCAAGAATCCGGGTGACCGTGGTC 4140 

4141 TCGAGAATGCTGCAGCCCAGCCAGGGGCTTGGACGGAGGCGTCGCGTGGTCCCGGAGACG 4200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4141 TCGAGAATGCTGCAGCCCAGCCAGGGGCTTGGACGGAGGCGTCGCGTGGTCCCGGAGACG 4200 

4201 GCATGTTCCCTTGGACCCTCCTGCAGTAGCCAGCAATTTGAGGAGCCGTGCCATGTCAAC 42 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

42 01 GCATGTTCCCTTGGACCCTCCTGCAGTAGCCAGCAATTTGAGGAGCCGTGCCATGTCAAC 42 60 

42 61 TCCTCCCAGCTCATCACGTGCCGCACACCTGCCCTCCCAGGCCTGCCTGAGGACCCCTGG 4320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
42 61 TCCTCCCAGCTCATCACGTGCCGCACACCTGCCCTCCCAGGCCTGCCTGAGGACCCCTGG 4320 

4321 GTCCGGGTGGAATTTATCCTTGACAACCTGGTCTTTGACTTTGCAACACTGAACCCCACA 4380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4321 GTCCGGGTGGAATTTATCCTTGACAACCTGGTCTTTGACTTTGCAACACTGAACCCCACA 4380 

4381 CCTTTCTCCTATGAGGCCGACCCCACCCTGCAGCCACTCAACCCTGAGGACCCCACCATG 4440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4381 CCTTTCTCCTATGAGGCCGACCCCACCCTGCAGCCACTCAACCCTGAGGACCCCACCATG 4440 

4 4 41 CCATTCCGGCACAAGCCTGGGAGTGTGTTCTCCGTGGAGGGGGAGAACCTGGACCTTGCA 4500 

44 41 CCATTCCGGCACAAGCCTGGGAGTGTGTTCTCCGTGGAGGGGGAGAACCTGGACCTTGCA 4500 

4501 ATGTCCAAGGAGGAGGTGGTGGCTATGATAGGGGATGGCCCCTGTGTGGTGAAGACGCTG 4560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4501 ATGTCCAAGGAGGAGGTGGTGGCTATGATAGGGGATGGCCCCTGTGTGGTGAAGACGCTG 4560 

4561 ACGCGGCACCACCTGTACTGCGAGCCCCCCGTGGAGCAGCCCCTGCCACGGCACCATGCC 4620 

4561 ACGCGGCACCACCTGTACTGCGAGCCCCCCGTGGAGCAGCCCCTGCCACGGCACCATGCC 4620 

4621 CTCCGAGAGGCACCTGACTCTTTGCCTGAGTTCACGGTGCAGATGGGGAACTTGCGCTTC 4680 

4 621 CTCCGAGAGGCACCTGACTCTTTGCCTGAGTTCACGGTGCAGATGGGGAACTTGCGCTTC 4 680 

4 681 TCCCTGGGTCACGTGCAGTATGACGGCGAGAGCCCTGGGGCTTTTCCTGTGGCAGCCCAG 47 40 

4 681 TCCCTGGGTCACGTGCAGTATGACGGCGAGAGCCCTGGGGCTTTTCCTGTGGCAGCCCAG 4 7 4 0 

47 41 GTGGGCTTGGGGGTGGGCACCTCTCTTCTGGCTCTGGGTGTCATCATCATTGTCCTCATG 4800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4741 GTGGGCTTGGGGGTGGGCACCTCTCTTCTGGCTCTGGGTGTCATCATCATTGTCCTCATG 4800 

4801 TACAGGAGGAAGAGCAAGCAGGCCCTGAGGGACTATAAGAAGGTTCAGATCCAGCTGGAG 4860 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4801 TACAGGAGGAAGAGCAAGCAGGCCCTGAGGGACTATAAGAAGGTTCAGATCCAGCTGGAG 4860 

4861 AATCTGGAGAGCAGTGTGCGGGACCGCTGCAAGAAGGAATTCACAGACCTCATGACTGAG 4920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
4861 AATCTGGAGAGCAGTGTGCGGGACCGCTGCAAGAAGGAATTCACAGACCTCATGACTGAG 4920 

4921 ATGACCGATCTCACCAGTGACCTCCTGGGCAGCGGCATCCCCTTCCTCGACTACAAGGTG 4980 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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4 921 ATGACCGATCTCACCAGTGACCTCCTGGGCAGCGGCATCCCCTTCCTCGACTACAAGGTG 4 980 

4981 TATGCGGAGAGGATCTTCTTCCCTGGGCACCGCGAGTCGCCCTTGCACCGGGACCTGGGT 5040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4981 TATGCGGAGAGGATCTTCTTCCCTGGGCACCGCGAGTCGCCCTTGCACCGGGACCTGGGT 5040 

5041 GTGCCTGAGAGCAGACGGCCCACTGTGGAGCAAGGGCTGGGGCAGCTCTCTAACCTGCTC 5100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5041 GTGCCTGAGAGCAGACGGCCCACTGTGGAGCAAGGGCTGGGGCAGCTCTCTAACCTGCTC 5100 

5101 AACAGCAAGCTCTTCCTCACCAAGTTCATCCACACGCTGGAGAGCCAGCGCACCTTTTCA 5160 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5101 AACAGCAAGCTCTTCCTCACCAAGTTCATCCACACGCTGGAGAGCCAGCGCACCTTTTCA 5160 

5161 GCTCGGGACCGTGCCTACGTGGCATCTCTGCTCACCGTGGCACTGCATGGGAAGCTTGAG 5220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5161 GCTCGGGACCGTGCCTACGTGGCATCTCTGCTCACCGTGGCACTGCATGGGAAGCTTGAG 5220 

5221 TATTTCACTGACATCCTCCGCACTCTGCTCAGTGACCTGGTTGCCCAGTATGTGGCCAAG 5280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5221 TATTTCACTGACATCCTCCGCACTCTGCTCAGTGACCTGGTTGCCCAGTATGTGGCCAAG 5280 

5281 AACCCCAAGCTGATGCTGCGCAGGACAGAGACTGTGGTGGAGAAGCTGCTCACCAACTGG 5340 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5281 AACCCCAAGCTGATGCTGCGCAGGACAGAGACTGTGGTGGAGAAGCTGCTCACCAACTGG 5340 

5341 ATGTCCATCTGTCTGTATACCTTCGTGAGGGACTCCGTAGGGGAGCCTCTGTACATGCTC 54 00 

5341 ATGTCCATCTGTCTGTATACCTTCGTGAGGGACTCCGTAGGGGAGCCTCTGTACATGCTC 5400 

5401 TTTCGAGGGATTAAGCACCAAGTGGATAAGGGGCCAGTGGACAGTGTGACAGGCAAGGCC 5460 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5401 TTTCGAGGGATTAAGCACCAAGTGGATAAGGGGCCAGTGGACAGTGTGACAGGCAAGGCC 54 60 

54 61 AAATACACCTTGAACGACAACCGCCTGCTCAGAGAGGATGTGGAGTACCGTCCCCTGACC 5520 

54 61 AAATACACCTTGAACGACAACCGCCTGCTCAGAGAGGATGTGGAGTACCGTCCCCTGACC 5520 

5521 TTGAATGCACTATTGGCTGTGGGGCCTGGGGCAGGAGAGGCCCAGGGCGTGCCCGTGAAG 5580 

5521 TTGAATGCACTATTGGCTGTGGGGCCTGGGGCAGGAGAGGCCCAGGGCGTGCCCGTGAAG 5580 

5581 GTCCTAGACTGTGACACCATCTCCCAGGCAAAGGAGAAGATGCTGGACCAGCTTTATAAA 564 0 

5581 GTCCTAGACTGTGACACCATCTCCCAGGCAAAGGAGAAGATGCTGGACCAGCTTTATAAA 564 0 

5641 GGAGTGCCTCTCACCCAGCGGCCAGACCCTCGCACCCTTGATGTTGAGTGGCGGTCTGGG 57 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5641 GGAGTGCCTCTCACCCAGCGGCCAGACCCTCGCACCCTTGATGTTGAGTGGCGGTCTGGG 5700 

5701 GTGGCCGGGCACCTCATTCTTTCTGACGAGGATGTCACTTCTGAGGTCCAGGGTCTGTGG 57 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5701 GTGGCCGGGCACCTCATTCTTTCTGACGAGGATGTCACTTCTGAGGTCCAGGGTCTGTGG 57 60 

57 61 AGGCGCCTGAACACACTGCAGCATTACAAGGTCCCAGATGGAGCAACTGTGGCCCTCGTC 5820 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

57 61 AGGCGCCTGAACACACTGCAGCATTACAAGGTCCCAGATGGAGCAACTGTGGCCCTCGTC 5820 

5821 CCCTGCCTCACCAAGCATGTGCTCCGGGAAAACCAGGATTATGTCCCTGGAGAGCGGACC 5880 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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5821 CCCTGCCTCACCAAGCATGTGCTCCGGGAAAACCAGGATTATGTCCCTGGAGAGCGGACC 5880 

5881 CCAATGCTGGAGGATGTAGATGAGGGGGGCATCCGGCCCTGGCACCTGGTGAAGCCAAGT 5940 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5881 CCAATGCTGGAGGATGTAGATGAGGGGGGCATCCGGCCCTGGCACCTGGTGAAGCCAAGT 5940 

5941 GATGAGCCGGAGCCGCCCAGGCCTCGGAGGGGCAGCCTTCGGGGCGGGGAGCGTGAGCGC 6000 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5941 GATGAGCCGGAGCCGCCCAGGCCTCGGAGGGGCAGCCTTCGGGGCGGGGAGCGTGAGCGC 6000 

6001 GCCAAGGCCATCCCTGAGATCTACCTGACCCGCCTGCTGTCCATGAAGGGCACCCTGCAG 6060 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6001 GCCAAGGCCATCCCTGAGATCTACCTGACCCGCCTGCTGTCCATGAAGGGCACCCTGCAG 60 60 

60 61 AAGTTCGTGGATGACCTGTTCCAGGTGATTCTCAGCACCAGCCGCCCCGTGCCGCTCGCT 6120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6061 AAGTTCGTGGATGACCTGTTCCAGGTGATTCTCAGCACCAGCCGCCCCGTGCCGCTCGCT 6120 

6121 GTGAAGTACTTCTTTGACCTGCTGGATGAGCAGGCCCAGCAGCATGGCATCTCCGACCAG 6180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6121 GTGAAGTACTTCTTTGACCTGCTGGATGAGCAGGCCCAGCAGCATGGCATCTCCGACCAG 6180 

6181 GACACCATCCACATCTGGAAGACCAACAGCTTGCCTCTGAGGTTCTGGATCAATATAATA 6240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6181 GACACCATCCACATCTGGAAGACCAACAGCTTGCCTCTGAGGTTCTGGATCAATATAATA 6240 

62 41 AAAAACCCGCAGTTTGTGTTCGACGTGCAAACATCTGATAACATGGATGCGGTGCTCCTT 6300 

6241 AAAAACCCGCAGTTTGTGTTCGACGTGCAAACATCTGATAACATGGATGCGGTGCTCCTT 6300 

6301 GTCATTGCACAGACCTTCATGGACGCCTGCACCCTGGCCGACCACAAGCTGGGCCGGGAC 6360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6301 GTCATTGCACAGACCTTCATGGACGCCTGCACCCTGGCCGACCACAAGCTGGGCCGGGAC 6360 

6361 TCCCCGATCAACAAACTTCTGTATGCACGGGACATTCCCCGGTACAAGCGGATGGTGGAA 6420 

6361 TCCCCGATCAACAAACTTCTGTATGCACGGGACATTCCCCGGTACAAGCGGATGGTGGAA 6420 

6421 AGGTACTATGCAGACATCAGACAGACTGTCCCAGCCAGCGACCAAGAGATGAACTCTGTC 6480 

6421 AGGTACTATGCAGACATCAGACAGACTGTCCCAGCCAGCGACCAAGAGATGAACTCTGTC 64 80 

64 81 CTGGCTGAACTGTCCTGGAACTACTCCGGAGACCTCGGGGCGCGAGTGGCCCTGCATGAA 6540 

64 81 CTGGCTGAACTGTCCTGGAACTACTCCGGAGACCTCGGGGCGCGAGTGGCCCTGCATGAA 654 0 

6541 CTCTACAAGTACATCAACAAGTACTATGACCAGATCATCACTGCCCTGGAGGAGGATGGC 6600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6541 CTCTACAAGTACATCAACAAGTACTATGACCAGATCATCACTGCCCTGGAGGAGGATGGC 6600 

6601 ACGGCCCAGAAGATGCAGCTGGGCTATCGGCTCCAGCAGATTGCAGCTGCTGTGGAAAAC 6660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6601 ACGGCCCAGAAGATGCAGCTGGGCTATCGGCTCCAGCAGATTGCAGCTGCTGTGGAAAAC 6660 

6661 AAGGTCACAGATCTATAGGAACCCAGGAGCCACGGCCTGCTGTTGCTTCAGCCTGGCCTG 6720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6661 AAGGTCACAGATCTATAGGAACCCAGGAGCCACGGCCTGCTGTTGCTTCAGCCTGGCCTG 6720 

6721 GGCAGCCCTGGAAGCTCGGAGGAGAGGCCACCTTCTTAGGTGCCTGTAGTGACTGACAAG 6780 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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67 21 GGCAGCCCTGGAAGCTCGGAGGAGAGGCCACCTTCTTAGGTGCCTGTAGTGACTGACAAG 67 80 

6781 CAGAGTTAGTGGAAGGTGACTCCCAGTCTCCTGGTGGCTCTGGCCTCGGCCCTGCTGGAT 6840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
6781 CAGAGTTAGTGGAAGGTGACTCCCAGTCTCCTGGTGGCTCTGGCCTCGGCCCTGCTGGAT 6840 

6841 CCACCTCCTAGACCCGGGGCCTCAAGGCTCATGGGGTAGTACCCAGCCTGCTCCCCGAGT 6900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
6841 CCACCTCCTAGACCCGGGGCCTCAAGGCTCATGGGGTAGTACCCAGCCTGCTCCCCGAGT 6900 

6901 CCAGCGACCCTGTGACACCGGTCTGCAGGGAGTTGGGGACTAAGGGCTTCCAGAGAGTGG 6960 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6901 CCAGCGACCCTGTGACACCGGTCTGCAGGGAGTTGGGGACTAAGGGCTTCCAGAGAGTGG 69 60 

69 61 CTGGAAGAGACTCCAGGCCCCTGGGGAGACTGTACTGTTCCTGAACACTGGCCTTGGCCA 7020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
6961 CTGGAAGAGACTCCAGGCCCCTGGGGAGACTGTACTGTTCCTGAACACTGGCCTTGGCCA 7020 

7021 CACTGGGATTCGGAGAGGAAGGAGGAGAGCCCCATGCTTCCTGTCTGCCTCCTCCACCAT 7080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7021 CACTGGGATTCGGAGAGGAAGGAGGAGAGCCCCATGCTTCCTGTCTGCCTCCTCCACCAT 7080 

7081 CCCTGACCTCAGTTGAGCTGCCTCTGGCCTTGTTGCTGCTGCCACATCCTAGGTCTAAGA 7140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7081 CCCTGACCTCAGTTGAGCTGCCTCTGGCCTTGTTGCTGCTGCCACATCCTAGGTCTAAGA 7140 

7141 GTTGAACGCCTCTCCTAGGCCACTACAAACTGACCCCTCAGCAGGGCTGGCTGCCACAGG 72 00 

7141 GTTGAACGCCTCTCCTAGGCCACTACAAACTGACCCCTCAGCAGGGCTGGCTGCCACAGG 7200 


7201 GCTGCCCTGCCTCATAGGTAGCCATGGTGAGGGCTATCTGCTGCAGGGGGGTCTTGGGGA 72 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
7201 GCTGCCCTGCCTCATAGGTAGCCATGGTGAGGGCTATCTGCTGCAGGGGGGTCTTGGGGA 72 60 

7261 GAGTGGTGACTCCATTGACCCAGCTTTTCATTAAAGGATAACACACTG 7308 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
72 61 GAGTGGTGACTCCATTGACCCAGCTTTTCATTAAAGGATAACACACTG 7308 
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Query Match 12.8%; Score 935.2; DB 

Best Local Similarity 57.8%; Pred. No. 4.7e-l 
Matches 2102; Conservative 0; Mismatches 1243; 


(PEKE ) PE CORP NY. 

Venter CJ, Adams M, Li PWD, Myers EW; 

WPI; 2002-682812/73. 

New isolated nucleic acid detection reagent for detecting the 

specified human exons. 

Claim 4; SEQ ID NO 14076; 40pp; English. 

The present invention relates to a novel isolated nucleic acic 
reagent for detecting the presence of specified human exons. The exon 
seguences cover every identified human transcript and exon comprising 
every gene/coding region of the human genome. The present sequence is one 
such exon seguence. The nucleic acid detection agent is used for 
detecting the presence of at least 100000, at least 2000, at least 50000 
or at least 10000 human exons. The sequences that span exon-exon 
junctions eliminate false signals caused by genomic contamination. This 
is because a detection element comprising two neighboring exons as one 
contiguous sequence will not hybridize to genomic DNA comprising 
intervening intronic DNA. These detection elements will only hybridize to 
expressed mRNA transcripts in which the exons are connected and the 
intronic sequence has been removed, therefore forming one contiguous 
stretch of sequence corresponding to the sequence of the detection 
element that spans the exon-exon junction. 

Sequence 5412 BP; 945 A; 1831 C; 1688 G; 948 T; 0 U; 0 Other; 

Length 5412; 

Indels 2 91; Gaps 13; 
3200 GCCATGTCACCTGCCAGCAGCACCAGCTCAGCTATGAGGCTCTGCAGCCGGAGCTCCGTG 325 9 
1907 GCCTCATCCACTGCCAGGCCCACCAGTTTTATCCCTCCATGTCCCAGCGGGAGCTCCCAG 1966 
32 60 TGGGGCTGTTTCTGCGTCGGGCCGGCCGTCTGCGTGTGGACAGTGCTGAGGGGCTGCATG 3319 
1967 TGCCCATCTACGTCACCCAGGGTGAAGCCCAGAGGCTGGACAACACCCATGCTCTTTATG 202 6 
3320 TGGTACTGTATGACTGTTCCGTGGGACATGGAGACTGCAGCCGCTGCCAAACTGCCATGC 337 9 
2027 TGATCCTGTACGACTGCGCCATGGGCCACCCGGACTGCAGCCACTGCCAAGCGGCCAACA 208 6 
3380 CCCAGTATGGCTGTGTGTGGTGTGAGGGGGAGCGTCCACGTTGTGTGACCCGGGAGGCCT 343 9 

2087 GGAGCCTGGGCTGCCTGTGGTGTGCTGACGGCCAGCCTGCCTGT CGCTATGGGCCCT 2143 

3440 GTGGTGAGGCTGAGGCTGTGGCCACCCAGTGCCCAGCGCCCCTCATCCACTCGGTGGAGC 34 99 
2144 TGTGCCCGCCGGGGGCTGTGGAGCTGCTGTGTCCTGCGCCCAGCATTGATGCAGTCGAGC 2203 
3500 CACTGACTGGGCCTGTAGACGGAGGCACCCGTGTCACCATCAGGGGCTCCAACCTGGGCC 355 9 
2204 CCCTGACCGGTCCCCCTGAGGGAGGCTTGGCCCTCACCATCCTGGGCTCCAACCTGGGCC 22 63 
35 60 AGCATGTGCAGGATGTGCTGGGCATGGTCACGGTGGCTGGAGTGCCCTGTGCTGTGGATG 3 619 
22 64 GGGCCTTCGCCGATGTGCAGTACGCCGTGAGCGTGGCCAGCCGGCCCTGCAACCCTGAGC 2323 
3 620 CCCAGGAGTACGAGGTCTCCAGCAGCCTCGTGTGCATCACCGGGGCCAGTGGGGAGGAGG 3 67 9 
2324 CCTCTCTCTACCGCACGTCGGCCCGGATTGTGTGTGTGACATCTCCTGCCCCCAATGGCA 2383 
3 680 TGGCCGGCGCCACAGCGGTGGAGGTGCCGGGAAGAGGACGTGGTGTCTCAGAACACGACT 373 9 
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23 84 CCACTGGGCCCGTCCGGGTGGCCATTAAGAGCCAGCCACCAGGCATCTCAAGCCAGCACT 2443 

3 74 0 TTGCCTACCAGGATCCGAAGGTCCATTCCATCTTCCCGGCCCGCGGCCCCAGAGCTGGGG 37 99 

2444 TCACCTACCAGGACCCTGTCCTGCTGAGCCTGAGTCCTCGCTGGGGCCCCCAGGCAGGGG 2503 

3800 GCACCCGTCTCACCCTGAATGGCTCCAAGCTCCTGACTGGGCGGCTGGAGGACATCCGAG 385 9 

2504 GCACCCAGCTCACCATCCGAGGTCAGCACCTCCAGACAGGTGGCA ACACCAGTG 2557 

38 60 TGGTGGTTGGAGACCAGCCTTGTCACTTGCTGCCGGAGCAGCAGTCAGAACAACTGCGGT 3 919 

255 8 CCTTCGTGGGTGGCCAACCCTGTCCCATCCTGGAGCCAGTGTGTCCGGAGGCCATCGTGT 2 617 

3 92 0 GTGAGACCAGCCCACGCCCCACGCCTGCCACGCTCCCTGTGGCTGTGTGGTTTGGGGCCA 3 97 9 

2i 18 G r,TA Ai",GCCCCAGGCTGCCCCAGGAGAAGCAGCGGTCCTTGTGGTCTTTGGCCATG 2677 

J 1^11 ,-.;Ai; i^i^AGGCTTCAACGCGGACAGTTCAAGTATACCTTGGACCCCAACATCACCTCTG 4039 

2-^8 A>; ACACTGCTCGCCAGCCCCTTCCGCTACACCGCCAACCCCCAGCTTGTAGCGG 2737 

4,14,1 T,;,; ACCAAGAGCTTCCTCAGTGGAGGACGTGAGATATGCGTCCGTGGCCAGAATC 4099 

21:- ,^,;A,^ AGTGCCAGCTTCCGGGGGGGTGGGCGACTGATCCGTGTCAGGGGCACCGGCC 2797 

4100 TGGACGTGGTACAGACGCCAAGAATCCGGGTG ACCGTGGTCT 4141 

27 98 TAGACGTGGTGCAGCGGCCCCTACTGTCTGTGTGGCTGGAGGCTGACGCAGAGGTGCAGG 2857 

4142 CGAGAATGCTGCAGCCCAGCCAGGGGCTTGGACGGAGGCGTCGCGTGGTCCCGGAGACGG 4201 

2858 CTTCCAGGGCCCAGCCCCAGGACCCACAGCCAAGGAGGAGCTGTGGAGCCCCTGCTGCGG 2 917 

4202 CATGTTCCCTTGGACCCTCCTGCAGTAGCCAGCAATTTGAGGAGCC GTGCCATG 4255 

2 918 ACCCCCAGGCTTGTATCCAGCTCGGTGGGGGGCTGCTGCAGTGCTCCACCGTCTGCTCCG 2 977 

425 6 TCAACTCCTCCCAGCTCATCACGTGCCGCACACCTGCCCTCCCAGGCCTGCCTGAGGACC 4315 

2 978 TCAACTCGTCCAGCCTCCTCCTGTGCCGGAGCCCTGCTGTACCAGACAGAGCCCACCCGC 3037 

4316 CCTGGGTCCGGGTGGAATTTATCCTTGACAACCTGGTCTTTGACTTTGCAACACTGAACC 4375 

3038 AGCGGGTCTTCTTCACCCTAGACAACGTGCAAGTGGACTTCGCCAGTGCCACTCGCGACG 30 97 

4376 CCACACCTTTCTCCTA TGAGGCCGACCCCA 4405 

30 98 GGCCTGCCCGCCCCTACCGCCTCAAGCCAGGCCATGTCCTGGATGTGGAGGTGAGGGCCA 3157 

440 6 CCCTGCAGCCACTCAACCCTGAGGACCCCACCATGCCATTCCGGCACAAGCCTGGGAGTG 44 65 

3158 CCTTCAACCCTGCCCCGCCACGGTGCTCAGGCCGCCTCTGTGGGGGCCAGCGGCTTAGGC 3217 

44 66 TGTTCTCCGTG GAGGGGGAGAACCTGGACCTTGCAATGTCCAAGGAGGAGGTGGTGG 4522 

3218 TCCCATGTGTGTCCCAGGGCGAGGGCCTCAACCTGGGCATCAGCAAGGAGGAGGTGCGCG 3277 

4523 CTATGATAGGGGATGGCCCCTGTGTGGTGAAGACGCTGACGCGGCACCACCTGTACTGCG 4582 

3278 TGCACATCGGCCGCGGCGAGTGCCTGGTGAAGACGCTCACGCGCACCCACCTGTACTGCG 3337 

4583 AGCCCCCCGTGGAGCAGCCCCTGCCACGGCACCATGCCCTCCGAGAGGCACCTGACTCTT 4 642 

3338 AGCCGCCTGCGCACGCCCCGCAGCCTGCCAATGGCTCCGGCC 337 9 
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5054 
3800 


5119 
3920 
5162 
3980 
5222 


TGCCTGAGTTCACGGTGCAGATGGGGAACTTGCGCTTCTCCCTGGGTCACGTGCAGTATG 
TGCCACAGTTCGTGGTGCAGATGGGCAATGTGCAGCTGGCCCTGGGCCCTGTGCAGTACG 

ACGGCGAGAGCCCTGGG GCTTTTrr'TGTGGrAGGrCAGGTGGGCTTGGGGGTGGGCA 

AGGCTGAACCCCCGCTGTCTGCCT' iVGGCAGGCGTGGGCATGGGTG 
CCTCTCTTCTGGCTCTGGGTGTCATCATCATTGTCCTCATGTACAGGAGGAAGAGCAAGC 
CTGCAGTGCTGATTGCCGCCGTGCTCCTCCTCACCCTCATGTACAGGCACAAGAGCAAGC 
A--,--, T -,A TATAAGAAGGTTCAGATCCAGCTGGAGAATCTGGAGAGCAGTGTGC 

^CAGAAGGTGCTAGTGCAGCTGGAGAGCCTGGAGACCGGCGTGG 
\ ZVATT AAA T ATI A( T( A( AT A AT T A AlTG 

GA^-V ' a A AGCG 

Ar ^T T", A AT TT( ( T( bA( TA( AAGGTI TAT A A ATCTTCT 

ACCTGGAGGGCAGCGGGATCCCCTTCCTGGACTACCGCACCTACGCCGAGCGCGCCTTCT 

TCCCTGGGCACCGCGAGTCGCCCTTGCACCGGGACCTGGGTGTGCC TGAGAGCA 

TCCCTGGCCATGGCGGTTGCCCGCTGCAGCCCAAGCCTGAGGGGCCAGGGGAGGACGGCC 
GACGGCCCACTGTGGAGCAAGGGCTGGGGCAGCTCTCTAACCTGCTCAACAGCAAGCTCT 
ACTGTGCCACTGTGCGCCAGGGCCTCACGCAGCTCTCCAACCTGCTCAACAGCAAGCTCT 

1 TCCTC 

) TCCTCCTCACGGTGAGGGCCGTGTGGCGGGAGTGCCCAGTGGGCAAGGAGGTGGGGCTGG 

ACCAAGTTCATCCACACGCTGGAGAGCCAGCGCACCTTTTCAG 

GGAACTACTGGCCTGAGACAAAGCTCATCCACACCCTGGAGGAGCAGCCCAGCTTTTCCC 
CTCGGGACCGTGCCTACGTGGCATCTCTGCTCACCGTGGCACTGCATGGGAAGCTTGAGT 
AGAGGGATCGCTGCCATGTGGCTTCGCTGCTGTCGCTAGCGCTACACGGCAAGCTGGAGT < 
ATTTCACTGACATCCTCCGCACTCTGCTCAGTGACCTGGTTGCCCAGTATGTGGCCAAGA J 
D ACCTGACGGACATCATGAGGACCCTGCTGGGTGACCTGGCGGCCCATTACGTGCACAGGA < 

2 ACCCCAAGCTGATGCTGCGCAGGACAGAGACTGTGGTGGAGAAGCTGCTCACCAACTGGA J 

3 ACCCCAAGCTCATGCTACGCAGGACAGAGACCATGGTGGAGAAACTGCTCACCAACTGGC < 

2 TGTCCATCTGTCTGTATACCTTCGTGAGGGACTCCGTAGGGGAGCCTCTGTACATGCTCT J 

3 TGTCCATCTGCCTGTACGCCTTCCTGAGGGAGGTGGCTGGTGAACCACTGTACATGCTCT 4 

2 TTCGAGGGATTAAGCACCAAGTGGATAAGGGGCCAGTGGACAGTGTGACAGGCAAGGCCA f 

3 TCCGGGCCATCCAGTACCAGGTGGACAAAGGCCCCGTGGACGCCGTGACAGGCAAGGCCA < 

2 AATACACCTTGAACGACAACCGCCTGCTCAGAGAGGATGTGGAGTACCGTCCCCTGACCT f 

3 AACGGACCCTGAATGATAGCCGCTTGCTGCGGGAGGACGTGGAGTTCCAGCCCCTGACGC 4 

2 TGAATGCACTATTGGCTGTGGGGCCTGGGGCAG GAGAGGCCCAGGGCG f 

3 TGATGGTGCTGGTGGGGCCCGGGGCTGGCGGGGCCGCAGGCAGCAGCGAGACGCAGCGCG 4 


3739 
5053 
3799 
5113 
3859 

3919 
5161 
3979 
5221 
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3 TGCCCGTGAAGGTCCTAGACTGTGACACCATCTCCCAGGCAAAGGAGAAGATGCTGGACC J 
3 TGCCAGCCCGGGTGCTCGACACGGACACCATCACCCAGGTCAAGGAGAAGGTGTTGGACC < 
3 AGCTTTATAAAGGAGTGCCTCTCACCCAGCGGCCAGACCCTCGCACCCTTGATGTTGAGT J 

3 AAGTCTACAAGGGCACCCCCTTCTCCCAGAGGCCCTCAGTGCATGCCCTAG < 

3 GGCGGTCTGGGGTGGCCGGGCACCTCATTCTTTCTGACGAGGATGTCACTTCTGAGGTCC J 


AGGGTCTGTGGAGGCGCCTGAACACACTGCAGCATTACAAGGTCCCAGATGGAGCAACTG J 

ACTTGGTCCCAGATGGAGCAACAG < 

TGGCCCTCGTCCCCTGCCT CACCAAGCATGTGCTCCGGGAAAACCAGGATT J 

TGGGGCTCGTCCCTCAGCTGCACCGTGGCAGCACCATCTCCCAGAGCCTGGCCCAGAGAT < 
ATGTCCCTGGAGAGCGGACCCCAATGCTGGAGGATGTAGATGAGGGGGGCATCCGGCCCT J 
GCCCCTTGGGAGAGAACATACCCACGCTGGAGGATGGCGAGGAGGGGGGGGTGTGCCTCT < 
GGCACCTGGTGAAGCCAAGTGATGAGCCGGAGCCGCCCAGGCCTCGGAGGGGCAGCCTTC J 
GGCACCTGGTGAAAGCCACCGAGGAGCCAGAAGGGGCCAAGGTGCGGTGCAGCAGCCTGC < 
GGGGCGGGGAGCGTGAGCGCGCCAAGGCCATCCCTGAGATCTACCTGACCCGCCTGCTGT t 
GGGAGCGCGAGCCAGCAAGGGCCAAGGCCATTCCGGAAATCTACCTCACCCGTCTGCTGT < 
CCATGAAGGGCACCCTGCAGAAGTTCGTGGATGACCTGTTCCAGGTGATTCTCAGCACCA i 
CCATGAAGGGCACGCTGCAGAAGTTTGTGGACGACACCTTCCAGGCCATTCTCAGCGTGA < 
GCCGCCCCGTGCCGCTCGCTGTGAAGTACTTCTTTGACCTGCTGGATGAGCAGGCCCAGC t 
ACCGGCCCATCCCCATCGCCGTCAAGTACCTGTTTGACCTTCTGGATGAGCTAGCAGAGA < 
AGCATGGCATCTCCGACCAGGACACCATCCACATCTGGAAGACCAACAGCTTGCCTCTGA i 
AGCACGGCATCGAGGACCCAGGGACCCTGCACATCTGGAAGACCAACAGTCTGCTGCTGC < 
GGTTCTGGATCAATATAATAAAAAACCCGCAGTTTGTGTTCGACGTGCAAACATCTGATA t 
GGTTCTGGGTGAATGCCTTGAAGAACCCACAGCTCATCTTTGATGTACGGGTGTCGGACA J 
ACATGGATGCGGTGCTCCTTGTCATTGCACAGACCTTCATGGACGCCTGCACCCTGGCCG i 
ATGTGGACGCCATCCTTGCTGTCATCGCCCAGACCTTCATTGACTCCTGTACCACCTCGG J 
ACCACAAGCTGGGCCGGGACTCCCCGATCAACAAACTTCTGTATGCACGGGACATTCCCC i 
AGCATAAAGTGGGCCGGGATTCCCCAGTGAACAAACTGCTCTACGCCCGGGAGATCCCAC J 
GGTACAAGCGGATGGTGGAAAGGTACTATGCAGACATCAGACAGACTGTCCCAGCCAGCG i 
GCTACAAGCAGATGGTGGAGAGGTACTATGCGGACATTCGCCAGAGCTCTCCGGCGAGCT 
ACCAAGAGATGAACTCTGTCCTGGCTGAACTGTCCTGGAACTACTCCGGAGACCTCGGGG 


ACCAGGAGATGAACTCTGCTTTGGCTGAGCTCTCCGGGAACTACACTTCTGCTCCCCACT f 
CGCGAGTGGCCCTGCATGAACTCTACAAGTACATCAACAAGTACTATGACCAGATCATCA f 


5194 
6520 
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Db 5255 GTCTGGAGGCTCTGCAAGAACTCTACAACCACATCCACAGGTACTATGATCAGATTATCA 5314 

Qy io581 i:TG':'':'':'TGGAGGAGGATGGCACGGCCCAGAAGATGCAGCTGGGCTATCGGCTCCAGCAGA 6640 

Db 5315 >5t>5':':'':t>;>5a>;>3a>;g,accctgtgggccagaagctgcagctggcctgccgcctgcagcagg 5374 

Qy ':.':.4i ttG':A';':tG':ti;ti;gaaaacaaggtcacagatctat 6676 

Db 5375 ti;g';';G';cctggtggaaaacaaagtgactgacctgt 5410 


